NON-ASYMPTOTIC ERROR BOUNDS FOR SEQUENTIAL MCMC AND 
STABILITY OF FEYNMAN-KAC PROPAGATORS 



NIKOLAUS SCHWEIZER 



Abstract. We provide a generic way of deducing non-asymptotic error bounds for Sequen- 
tial MCMC methods from suitable stability properties of Feynman-Kac propagators. We 
show how to derive this type of stability from mixing conditions for the MCMC dynam- 
ics, namely, spectral gaps and hyperboundedness, and from upper bounds on the relative 
densities in the sequence of distributions. 



1. Introduction 

Since the 1950s (Metropolis et al. [18J), Markov Chain Monte Carlo (MCMC) methods have 
become an increasingly popular tool for challenging numerical integration problems in a wide 
variety of fields ranging from chemical physics to financial econometrics (see, e.g. Liu [17]). 
The basic idea is to approximate the integral of a function / with respect to a probability 
measure // by simulating a Markov chain with stationary distribution fi and to calculate the 
ergodic average of / evaluated at the positions visited by the Markov chain. By construction, 
MCMC methods only work well if the simulated Markov chain reaches equilibrium sufficiently 
quickly. Roughly speaking, this is the case when ^ is essentially unimodal and it is not the case 
when /i is severely multimodal in the sense of being characterized by several well-separated 
modes. In the latter case, MCMC methods tend to get stuck in local modes for very long 
times and therefore approach their equilibrium fi only on time-scales well beyond those that 
can feasibly be simulated. This metastability phenomenon is a serious drawback of standard 
MCMC methods in many applications. 

Sequential MCMC methods (see, e.g., Del Moral, Doucet and Jasra [8] and the references 
therein) are a class of algorithms which try to overcome this problem. The basic idea is to 
approximate the target distribution /i with a sequence of distributions no, . . . ,Hn such that 
/i„ = ^ is the actual target distribution and such that //q is easy to sample from. The aim 
is to carry over the good mixing/sampling properties from fiQ to the target fin using interpo- 
lating distributions fi^ where /i^ and fik+i are sufficiently similar to allow for, e.g., efficient 
importance sampling. The algorithm constructs a system of particles which sequentially 
approximates the measures fiQ to The algorithm is initialized with N independent sam- 
ples from iiQ and then alternates two types of steps. Importance Sampling Resampling and 
MCMC: In the Importance Sampling Resampling steps, a cloud of particles approximating 
nk is transformed into a cloud of particles approximating fik+i by randomly duplicating and 
eliminating particles in a suitable way, depending on the relative density between fik+i and 
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fik- This step is similar to the selection step in models of population genetics where particles 
form the population and where the relative density takes the role of a fitness function guiding 
the number of off-spring a particle has. In the MCMC steps, particles move independently 
according to an MCMC dynamics for the current target distribution in order to adjust better 
to the changed environment. This step resembles the mutation step in models of population 
genetics. The algorithm is essentially the same as the particle filter of Gordon, Salmond and 
Smith [IB] yet the application - sampling from a fixed target distribution instead of filtering 
with an exogenous sequence of distributions - is different. The error analysis of this paper 
focuses on a simple Sequential MCMC algorithm with Multinomial Resampling which is a 
basic instance of the class of algorithms introduced in Del Moral, Doucet and Jasra [8]. 



This paper contributes to the literature on error bounds and stability analysis of Sequen- 
tial MCMC. Specifically, we are interested in proving explicit non-asymptotic error bounds 
which are well-suited to the high-dimensional and non-compact state spaces prevalent in 
many applications of Monte Carlo Methods. Our two main results can be summarized as 
follows: Theorem 13.21 provides a generic way for deriving non-asymptotic bounds on the mean 
squared integration error from suitable stability properties of the Feynman-Kac propagator 
associated with the particle system. The bound of the theorem applies to the so-called un- 
normalized particle approximation but immediately implies a bound for the actual particle 
approximation provided by the algorithm, see Lemma 12.21 The bound in the theorem is 
sharp in the sense that the leading error term corresponds to the asymptotic variance of the 
unnormalized particle approximation found, e.g., in the central limit theorem of Del Moral 
and Miclo [9]. Theorem 14.51 derives Lp-stability bounds for the Feynman-Kac propagator 
which can be applied in the framework of Theorem 13.21 Specifically, we derive stability in 
Lp from sufficiently good mixing of the MCMC dynamics in the sense of a sufficiently large 
spectral gap and from uniform upper bounds on relative densities. -L2p--^^p-bounds, which 
we need in our error bound for Sequential MCMC, follow under an additional assumption of 
hyperboundedness for the MCMC dynamics. 

Basically, our results are discrete-time, general state space analogues of the error bounds 
and stability results given, respectively, in Eberle and Marinelli [2], [13] who analyze a 
related continuous-time particle system with simultaneous resampling and MCMC on a finite 
state space. The generalization to discrete time is important since this corresponds to the 
algorithms implemented in practice. Moreover, in the discrete time setting a uniform lower 
bound on relative densities is not needed to derive the results. 

There is by now a sizeable literature on asymptotic error bounds for Sequential MCMC 
and related particle systems beginning with the central limit theorems in Del Moral [6], 
Chopin [4j, Kiinsch [16] and Cappe, Moulines and Ryden, [2]). See Del Moral [7] for an 
overview and many results, and Douc and Moulines [llj for a recent contribution. Non- 
asymptotic error bounds, i.e., error bounds for a fixed number of particles are comparatively 
less studied, see Del Moral and Miclo [9], Theorem 7.4.4 of Del Moral [7], Cerou, Del Moral 
and Guyader [3] and the references in these for some results. These works rely on mixing 
and boundedness conditions which will usually not be satisfied on a non-compact state space. 
Namely, this concerns uniform upper and lower bounds on relative densities between /ifc+i 
and and uniformly bounded relative densities between MCMC transition kernels from 
different starting points. To see the limitation of such conditions consider the case where 
fik and iJ-k+i are two different Gaussian distributions on M. Then the relative density tends 
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either to zero or to infinity in the tails. Similarly, the transition kernels of most commonly 
used MCMC dynamics will be almost singular for starting points which lie far apart. 

To see why upper bounds on relative densities are less restrictive than lower bounds in 
typical applications of Sequential MCMC, note that the most common way of choosing the 
distributions fik is by setting /Xfc(-) ~ exp(— for an increasing sequence /S^ of (artificial) 
inverse temperature parameters, see, e.g., Neal [19]. Then the relative density from //^ to 
fj-k+i is proportional to exp(— — l3k)H{-)) which is bounded from above whenever the 
Hamiltonian H is bounded from below which is the case in most applications. 

Recently, Whiteley [22] proved non-asymptotic error bounds that can be expected to be 
applicable to non-trivial models with non-compact state spaces. In Whiteley's setting, the 
usual global mixing condition is replaced by a family of minorization conditions on a small set 
and drift conditions outside the small set, applying results of Douc, Moulines and Rosenthal 
[12] . These results are complementary to ours since they rely on a rather different set of 
conditions. In addition, our results are more explicit allowing, e.g, to make rather precise 
predictions about the dimension dependence of the algorithm. Whiteley's results have a 
somewhat wider scope, applying, e.g., to unbounded integrands and settings with an initial 
bias. 

The literature on non-asymptotic error bounds - including this contribution - has largely 
focused on the case where the underlying MCMC dynamics mixes well globally for all the 
distributions ix^. This is, of course, unfortunate since one main motivation for applying 
a multilevel MCMC method such as Sequential MCMC is the presence of well-separated 
modes in the target distribution which renders the application of simple MCMC dynamics 
infeasible. An exception is Eberle and Marinelli [13^ [T3] who address in their continuous-time 
framework the case in which the state space is decomposed into several components that 
are not connected for all Hk- In a follow-up paper, Schweizer [21]), bounds for more general 
multimodal cases are derived from Theorem 13.21 presented here. 

The results contained in here are extracted from a more detailed presentation in the disser- 
tation [20J. The paper is structured as follows: Section [2] introduces the setting and recalls 
a number of results from the literature, chiefly, an explicit expression for the variance of the 
reweighted particle approximation. Section [3] shows how to derive error bounds from suitable 
stability properties of the Feynman-Kac propagator associated with the model. Section [H 
shows how to derive these stability properties from mixing conditions of the MCMC dynam- 
ics. Section [5] gives more explicit statements of our error bounds in terms of the constants in 
Poincare and Logarithmic-Sobolev inequalities. Finally, Section [6] gives a brief discussion of 
the dependence of the algorithm's error on the dimension of the state space in light of our 
results. 



2. Preliminaries 

2.1. Notation. Let {E,r) be a complete, separable metric space and let B{E) be the cr- 
algebra of Borel subsets of E. Denote by M{E) the space of flnite signed Borel measures on 
E. Let Mi{E) C M{E) be the subset of all probability measures. Let B{E) be the space of 
bounded, measurable, real-valued functions on E. 
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For II £ M{E) and / G B{E) define //(/) by 

= / f{x)ix{dx) 

JE 

and Var^(/) by 

Var^(/) = M/')-M/)'- 

Let (ii^, r) be another Polish space. Consider a kernel K{x, A) with K{x, •) € M{E) for x £ E 
and A) G S(^) for ^ e S(^). We define for G M(S) the measure iiK E M(S) by 

liK{A)= [ K{x,A)n{dx) yAeB{E). 

JE 

For / G B{E) we denote by € B{E) the function given by 

K{f){x) = K{x, f)= [ f{z)K{x, dz) yx G E. 
JE 

2.2. The Measure- Valued Model. Consider a sequence of Polish spaces {Ek,rk) and a 
sequence of probability measures (^fc)fc=05 /^fc ^ Mi{Ek)- This is the sequence of measures 
we wish to approximate with the algorithm introduced in Section 12.31 The measures //^ are 
related through 

^J'k{J) = 7 ^ V/ G B{Ek) 

lJ'k-l{gk-l,k) 

for positive functions gk-i.k G B{Ek-i) and transition kernels with Kk{x,-) G Mi{Ek) 
for X G -Efc-i and -K'fc(-,^) G B{Ek_i) for A G B{Ek). We define the probability distribution 
/ifc G Mi(^fc_i) by 

This implies fik{Kk{f)) = ^J-kif) for / G B{Ek). 

Next we introduce the Feynman-Kac propagator qj ^. which will be the central object of our 
error analysis. Define the mapping qk-i,k ■ B{Ek) — >■ B{Ek-i) by 

, 9k-i.kKk{f) 
Qk-i,k{j) — 7 

Observe that this implies 

fj-kif) = fj,k-iiqk-i,kif))- 

Furthermore define for < j < k < n the mapping qj^k ■ B{Ek) — >■ B{Ej) by 

<lj,kU) = QjJ+iiQj+iJ+2{- ■ ■ Qk^iMf))) 
and qk,k{f) = /• Note that for / G B{Ek) we have the relation 

IJ-j{<lj,k{f)) = f^kij) fovO< j<k<n 
and the semigroup property 

Qj,i{Qi,k(.f)) = Qj,k{f) for < j < I < k < n. 
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2.3. The Interacting Particle System. In the Sequential MCMC algorithm, we approx- 
imate the measures {nk)k by simulating the interacting particle system introduced in the 
following. We start with independent samples = (^O'-'-'^o') from /j,q. The particle 
dynamics alternates two steps: Importance Sampling Resampling and Mutation: A vector of 
particles approximating fik-i is transformed into a vector S,k approximating /i^ by draw- 
ing N conditionally independent samples from the empirical distribution of $,k-i weighted 
with the functions gk-i,k- Afterwards, is transformed into a vector approximating fi^ 
by moving the particles independently with the transition kernel K/^. 

We thus have two arrays of random variables iS,k)o<k<n,i<j<N and i^,i)i<k<n,i<j<N where 
and take values in Ef^. Denote respectively by ¥[■] and E[-] probabilities and expectations 
taken with respect to the randomness in the particle system, i.e., with respect to the random 
variables {^Dkj and {Ci)k,j- Denote by J^k the ci-algebra generated by ^O; • • • ^k and ^i, . . . 
and by the cj-algebra generated by • • • ^fe-i and ^i, • • • ^fc- Denote by rj^ the empirical 
measure of ^fc, i.e., 

1 ^ 

4=1 

The algorithm proceeds as follows: 

(i) Draw ^q, . . . , independently from /xq. 

(ii) For k = 1, . . . , n, 

(a) draw S,k = {^l, ■ ■ ■ according to 

j=l 1=1 1^1=1 9k-l,k\<ik-l) 

(b) draw £,k = {(1^ ■ ■ ■ ^^k) according to 

N 

¥[ik^dx\Pk\ = J{Kk{ildx^). 

3=1 

(iii) Approximate by 

1 ^ 

i=l 

In the following we will study, how well r]^ approximates fi^- We are mainly interested 
in the case where the state spaces E^. are identical and where K^. encompasses many steps 
of an MCMC dynamics (e.g., Metropolis) with target /j.^- In that case gk-i,k becomes an 
unnormalized relative density between /ifc-i and and we have jit = A*fc- We will restrict 
attention to that setting from Section U] on, but up to then this additional generality comes 
basically for free. At that point we will also add assumptions ensuring that /Ufc_i and /i^ are 
sufficiently similar (namely. Assumption iBj) . 

We end the preliminary observations with the following lemma: 
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Lemma 2.1. We have for f G B{Ek) and I < k < n 

HVk (/Jl-^fc-ij = nr^ 7T\Y 

Vk-i{Qk-i,kW) 

and for 1 < j < N 

w\-e(cr\\j: 1 Vk~i(^k-i,k{f)) 

W{Ck)\J^k-i\ = -^ri 7T\T- 

Vk-iiQk-i,kW) 
Proof. Note that J^k-i ^ ^k ^ ^k- We can thus write 

nfinm-i] = E[E[/(^i.)|j-fe]|j-fc_i] 
= nKkiii f)\:Fk-i] 



T:u9k-iAek^i)Kk{ek-^j) 

Ylh=i9k-i,k{^i^i) 
Vk~ii9k-i,kKk{f)) 



rik-i{9k-i,k) 



Vk-iiQk-iA'^)) ' 

This proves the claim for E[/(^^)|J^fc_i] and immediately implies the claim for E[r/^ {f)\Tk~i]- 

□ 

2.4. Variances of Weighted Empirical Averages. We are interested in finding efficient 
upper bounds for the quantities 

and 

E[|^^(/)-/^n(/)|]- 

As is shown in Lemma 12.21 below, these quantities can be controlled in terms of the approxi- 
mation error of a weighted empirical measure which is easier to handle. In this section, 
we introduce the measure (/) and establish an explicit formula for its quadratic error 

IE[k^(/)-^n(/)P]. 



Define for < A; < n 
where (fk is given by 



^^{f) = VkV^if), 

k-l 

for \ < k < n and ipQ = 1. Note that typically is not a probability distribution and that 
(fk is J-fc_i-measurable. The factor (pk is chosen in a way that from Lemma l2. II we have 

n^i:{fm-i] = uti{<ik-iM))- (1) 

By the unbiasedness proved in Proposition 12.31 below it also holds that 

E[(^fc]=EK^(l)]=EK(l)] = L 
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Furthermore, the fohowing relation will prove useful: 

z^^i(l) = iPk+i = ^kVk (9fc,fc+i(l)) = (gfc,fc+i(l))- (2) 

The connection between the approximation errors of t]^ (/) and Vn if) is established in the 
following lemma: 

Lemma 2.2. For f E B{En) define fn = f — f^nif) CLnd denote by \\ ■ ||sup,n the supremum 
norm on B{En)- Then we have the hounds 

niri^U) - ^^n{f)?] < 2 Var(i.^(/„)) + 2 ||/n||Lp,nVar(z.„^(l)) (3) 

and 

n\^^{f)-^^n{f)\] (4) 

< Var(z.^(/„))l + ^/2||/„||sup,„Var(i.^(l)) + V2Var{v^ . 
Proof of Lemma \2.^ Observe that for o, 6 G M the fact that (a — 26)^ > implies 

<2{a-bf + 2b'^. 

We can thus prove ([3]) as follows: 

nri^ifnf] < 2n{ri^{fn)-V^{fn)?]+2n^^{fnf] 

< 2||/„||2^p,„Var(z.^(l)) + 2Var(i.^(/)), 

where the last step uses the unbiasedness of proved in Proposition l2.3l below. To show dU, 
observe that by the triangle inequality, by the definition of and by the Cauchy-Schwarz 
inequality we have 

IE[h^(/n)|] < E[|r?^(/„)-i.^(l)r,^(/„)|]+E[|^„^(/„)|] 

< E[r?^(/„)2]^E[K^(1) - +EK^(/„)2]i 

Inserting ([3]) completes the proof. □ 

error of ul^ . 



Thus we can indeed control the approximation error of ri^ in terms of the approximation 



The main result of this section shows that (/) is an unbiased estimator for Hkif) and gives 
an explicit expression for its variance which is well-suited for deriving our later error bounds. 
These results are known in the literature, see, e.g., Del Moral and Miclo ([9j, Section 2). 
They are stated here (and proved below) for the sake of completeness and to present exactly 
what we need in a stringent way and in the form in which we need it. 

Proposition 2.3. For all f £ B{En), 

IEK^(/)]=/^n(/) 

and 



n^^{f) - ^^n{f)?] = ^Var^„(/) + 1e 



i=o 



where 
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The proof of the proposition is based on martingale methods and proceeds in a number of 
lemmas which make up the remainder of this section. The actual proof of the proposition 
follows at the end. Note first that for any fixed / G B{En) the process (^j)"^Q defined by 

is an (J^j )-martingale by ([1]) and by the semigroup property of the mappings qj^n- Recall that 
by the Doob decomposition the process Hj given by 



H,=A]-Al-Y,mUi-Al\Fk] 

is a martingale. We next derive a more explicit expression for Hj. 
Lemma 2.4. We have 

1 ^'^ 



(6) 



fc=0 



Proof. By the definitions of A}^, and r/^ and since the random variables . . . ,^^+1 
are conditionally (on J^^) independent we can write 

2 



Ar2 

N 



N 



N 



N 



+ 



{NE[qk+i,n{ml+if\J'k] - NE[qk+iM)iCl+im 

+ N^E[qk+i,n{ml+im?)- 
Thus by Lemma l2.ll and the semigroup property of the qj^^ we have 



iV2 



N 



1 

N 



{Qk,k+i{i)yk {qk,k+i{qk+i,n{fy)) - ^kiqkAf)) 



.N 



N I 



where in the last step we used Inserting the resulting expression for E[A^^-^ — 

into ([6]) concludes the proof. □ 

We can use Lemma 12.41 to derive an explicit expression for E[|f^ (/) — In order to 

make this expression more tractable, concretely, in order to remove the terms Q'/c,fc+i('Zfc+i,n(/)^)) 
we use the following lemma: 
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Lemma 2.5. For all f £ B{En) the processes 

k—1 k—1 
j=0 j=0 

and 

k-l 

M, = i^j^ {qk,nif)) - '^oi^KiQoAf)) - E '^f (^.•-.+1(1) - iHiQjAf)) (8) 

j=0 

are (J^i^) -martingales. 

Proof. By the Doob decomposition, for = {^)^k {'ik,n{fY) the process 

fc-l k-l 

Lk = Bk - Bq - "^ ¥.[Bj^i\Fj\ + E Bj 
j=o i=o 

is a martingale. To obtain the expression in ([7]) it is sufficient to note that by Lemma 12.11 
and by ^ 

w\T3 I T- 1 2 TUT Af / ^^\2\|-zri 2 Vj^ {Qj,j+l{Qj+l,n{f )'^)) 

= ^f(g,,,+i(l))z.f(g,,,-+i(<?,+i,„(/)2)). 
Likewise for B^ = {qk,n{f)) the process 

Mk = Bk-Bo- - Bj\Tj] 

j=0 

is a martingale. To obtain the expression in ([8]) note that by ([T]) and by ([2]) 

E[S,-+i-5,|J-,] = z.f+i(l)E[z.f^.i(<z,-+i,„(/2))|J-,.]-^f(l)..f(<^,.„(/2)) 

= i.f{qj,+,il)-iyf{q,,M'))- 

With these lemmas, we have established the tools needed to prove Proposition 12. 3t 
Proof of Proposition [^31 For the unbiasedness, note that 

iy^{f)-l^n{f) = l^n{Qn,nif))-''0iQ0Af))+^0iQ0Af))-Mq0,n{f)) 

= An- Ao + v^{qoAf)) - no{qoAf)). 

This implies 

E[l.:!{f)]=E[^,M)] 

since the martingale property of An implies E[An — Aq] = 0, and since we have 
because of Uq = rj^ and because the particles are independent samples from //q- 



□ 
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Now note that by conditional independence 

E[k^(/)-/in(/)P] = E[|(A„-Ao) + K^((?0,n(/))-^o(gO,n(/)))P] 

= E[Al-Al]+E[i,.^{qoAf))-f^o{qoAf))n (9) 

Note that the second summand equals ;^Var^Q(go,n(/))- Using Lemma \2A\ and, in the second 
step, ([7j) we can thus write 



nw:^if)-^inif)\' 



N 



Var^(,(go,n(/)) 



n-1 



+ 



k=0 



1 

iV 

n-1 



Var^„(go,n(/)) + E {1)^^ {q^Aff) " (1)^^^^ (9o,n(/)^) 



N ^^\,.N I 



(10) 



fc=0 



Now observe that 



Var^„(go,n(/)) - {q^Af ?)] = Var^„(go,n(/)) - MqoAf f) 

Moreover, by ^ we have 

E[^^(l)z.„^(/2)]-;.„(/)2 = EK^(l)i.o^(go,n(/'))]-^n(/)' 



(11) 



n-1 



+ ^E[^f(g,,,-+i(l)-l)z.f(<z,-„(/2))] 



j=0 



n-1 



j=0 



(12) 



Inserting pT|) and then ([12]) into pOj) yields 



E[kn^(/)-Mn(/)|^ 



Var^J/)+E 



n-1 



with 



vj^M) = ^f{i>f{qUfr) - ^fiqjAnr + <(9,„+i(i) - i)<(g,,n(/^)) 

so we are done. 



□ 



Throughout, we assume the integrands / to be bounded. This assumption becomes crucial 
when transferring results from the weighted particle measure to the original one in Lemma 
2.2[ For this reason, we restrict attention to bounded integrands, avoiding to introduce 
systems of integrability conditions (as is done, e.g., in Chapter 9.2 of Cappe, Moulines and 
Ryden, [2]). However, the results on the weighted particle measures do not rely on this 
boundedness and could be generalized along such lines. 
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3. NON-ASYMPTOTIC ERROR BOUNDS 

In this section we derive non-asymptotic error bounds from our expression for the variance 
of u,^ (/) derived in the previous section. For < j < n, let || • ||j be a norm on the function 
space B{Ej) such that < oo for all / G B{Ej). We obtain our bounds from the following 
assumption which arises naturally in the proofs. In the later sections, we show how to verify 
the assumption from more elementary conditions in more specific settings. 

Assumption A. For all < j < k < n, there exists a constant Cj^k such that for all 
f G B{Ek), the following stability inequality for the propagator qj^k is satisfied: 

max(||l||,-||g,-,(/)2||,, \\qj,,{f)\\l \\q,M')\\j) < Cj,k\\f\\l (13) 

In the following we show that the quadratic approximation error of can essentially be 
controlled through the constants Cj^k- 

Recall that the approximation error was given by the expected sum over j of the expressions 
Vj^^ defined in the previous sections. We first show how Vj^^ can be bounded through Vj^n 
defined by 

F,-„(/)=Var^^.(g,,„(/)) 

and an error term. Note that Vj^n is what we obtain when substituting z^j^ by fxj in our 
expression for Vj^^. Define 

= sup{E[|z.f(/)-^,(/)|2]|||/||,.<l}. 
Then we have the following result: 



Proposition 3.1. For < j < n we have 



Proof. Note first that by the Cauchy-Schwarz inequality and since z^j^(-) is an unbiased esti- 
mator for jUj(-), we have for any g,h £ B{Ej) 

< \^^,{g)E[u^{h)-^lJ{h)]+^,,{h)E[uf{g)-^,,{g)]\ 

+ E[|^f(5)-^,(5)||^fW-M,WI] 

< MjWhhef. (14) 

Adding ±.Vj^n{f) to the definition ([5|) of Vj^^{f ) and applying three times yields 
with 

R,Af) = UUijAn'Wj + lk,,n(/)||,' + lk,,„(/2)||,||g,,,+i(i) - 111,. 

Applying ([T3|) yields 

^i,n(/)<C,>||/||^(2+||(?,,,+i(l)-l||,) 

and thus the desired inequality. □ 
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In order to state the main result of this section, we need a few more definitions. Define Ck 
and Vk by 

fc-i 

Ck = ^Cj^kh + \\qj,j+iil) - 1| 



and 



j=0 



Vk = sup ^Var^^(gj-fc(/)) 

j=0 



<1>. 



Furthermore define 

- ^ — ^ J —N N 

Ck = maxc,-, Vk = maxvi and Eu = maxe,- . 
j<k ^ j<k ^ j<k ^ 

Then we have the fohowing bound on the approximation error: 



Theorem 3.2. For f G B{En) we have 



iVE[|z.„^(/) < j;Var^^.(g,,„(/)) + ||/||^c„ e^. (15) 



Moreover, if N > 2cn we have 



< 2^. (16) 



Proof of Theorem \3.S\ Note that by Propositions 12.31 and 13. II and by the definition of Vj^nif) 
we get 

NE[\u^ if) - 

n n— 1 

< ^Var^^.(g,,„(/)) + ^c,-„(2 + ||g,-,+i(l) - 1||,) ef. 

3=0 j=0 

Bounding by and inserting the definition of c„ shows p^ . Optimizing (fT5]) over / 
with ||/||n < 1 and over n yields 

Ne^ <Vn + Cne^. 

Choosing iV > 2 c„ and thus N — Cn> y gi'^ss ([TB]) . □ 



4. Lp-STABILITY 

In this section, we show how Lp-stabihty of the Feynman-Kac propagators qj^k can be derived 
from global mixing properties of the MCMC kernels Kk- We now consider a more restricted 
setting where the measures fik live on a common state space and where the transition kernels 
Kk are stationary with respect to the measures fik- 
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4.1. The setting. We now add the assumption that all the distributions fik live on the same 
support -Efc = E. Moreover, we assume stationarity of the kernels K)^: For 1 < < n, let 
Kk{x,A) be an integral operator with Kk{-,f) G B{E) for all / G B{E), with Kk{x,-) £ 
Mi{E) for all X G and with stationary distribution /ifc, i.e., 

/ifcKfc(^) = /ife(A) for ah A£B{E). 

Thus, Kk can be thought of, e.g., as many steps of a Metropolis chain with respect target 
/ifc. Accordingly, the are related through 

fj-k-i{gk-i,kf) 

f^kif) = 7 ^- 

fJ'k-l[9k-l,k) 

SO that the functions gk-i.k G B{E) are (unnormalized) relative densities between n^-i and 
/ifc. In the notation of Section [2.21 this corresponds to jit = fJ-k for all k. 

We now want to apply the error bound of Theorem 13. 2i We thus need to define a series of 
norms || • \\j on B[E) and find constants Cj^k such that the inequality 

max [\\q,M)\\l h^MfWv hiAf^j) < Cj,k \\f\\l (17) 
from Assumption lAl is satisfied for all / G B(E). 

We fix p > 2 and choose || • \\j = \\ ■ Wipdij), i-e-, 

ii/iii = ii/iiL,(M,) = /^i(i/n'- 

Note that all bounded measurable functions are in Lp(^j) and that for / G B{E) we have 
Qj,k{f) G B{E) since we assumed the relative densities gk-i,k to be bounded. Now define 
^j,k{Pj q) to be the constant in an Lp-Lg-bound for qj^k- For p > q > 1 and < j < k < n we 
have 

hjADh^H) ^ hkiP,(l)\\f\\L,(f,,) for ah / G B{E). 
Such constants will be studied in Section l4?2] below. The next lemma shows how the constants 
Cj^k can be used to verify a version of Assumption [A) 

Lemma 4.1. Define for some p > 2 and all < j < k < n 

Cj,kip) = max (dj^k (p, I) , Cj,k{'^p,pf^ ■ 

Then |j7| j is satisfied with \\ ■ \\j = \\ ■ \\Lp(pj) Oind with constants Cj^k{p), i.e., for all f G B{E) 
we have 



max (||l||Lp(/.,)lki,fe(/)^llLp(M,)> lki,fc(/)llip(^,)> lkj,fc(/^)IUp(/.,)j < CjMip) 
Proof. Observe first that we have 

Furthermore we have = 1 and 

lki,fc(/)^llLp(/.,) = hjMDWhpi,.,) < Ci,fc(2p,p)^||/||i^(^^). 

Finally, observing that 

concludes the proof. □ 
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4.2. Lp-Stability of Feynman-Kac Propagators. In this section we show how to derive 
inequahties of the type 

lki,A:(/)llLj,(M,) < Cj,kip,q)\\f\\L,(,^k) for p < q,j < k (18) 

from suitable mixing conditions on the kernels Kj, . . . , Kj.. These are exactly the inequalities 
we need in order to make the error bounds of the previous section explicit. The constants Cj^k 
we derive are independent of the length of the time interval k — j. The central intermediate 
step is Theorem 14.51 which derives (jlSp for the case p = q = 2^', r G N, and for a modified 
propagator cjj from an L2-mixing condition for the kernels Kj, . . . , K^. The proof of Theorem 
l4.5l proceeds in a number of lemmas starting with only one step, k—j = 1, and p = 2 and then 
gradually generalizing the result by showing how to proceed from Lp-stability to -L2p-stability 
and to more than one step, k — j > 1. Theorem 14.51 is followed by a number of corollaries, 
showing how to transfer the result to the original propagator fc, and, using an additional 
assumption of hyperboundedness, to the case p > q. Corollarv 14.91 collects these observations 
and states the resulting version of inequality (jlSp . We close the section with an additional 
result. Proposition 14.101 which shows that for functions / with ^k{f) = we can obtain a 
version of inequality (jlSp where the constants Cj kip,Q) decay exponentially in the length of 
the time interval k — j. 

Throughout this section we assume a uniform upper bound 7 on the normalized relative 
densities ^k-i.k given by 

_ _ 9k-l,k 

9k-i.k — 7 

fJ'k-l{gk-l,k) 

Assumption B. There exists 7 > 1 such that 

9k-i,k{^) < 7 (19) 

for all X £ E and all k with 1 < k < n. 

7 is a rough measure of how strongly the measures /i^ differ from each other. For the analysis 
of this section it proves to be convenient not to work with qj^k directly but to work with qj^k 
defined as follows: For 1 < k < n define qk-i,k ■ B{E) — B{E) by 

qk-i,k{f) = Kk-1 {gk-i,kf) ■ 

Furthermore, define for 1 < j < k < n the mapping qj /. : B{E) — t- B{E) by 

QjM) = 9i,i+i(9i+i,j+2(- • • qk-iM))) and qk,kU) = /• 
By definition qj^^ is a semigroup, qj^k and ^j+i,fc are related through 

QjM) = 9j,j+iQj+i,kiKkif))- 

Stability results for ^ can be transfered to qj^k using Lemma HH] which is stated and proved 
later in this section. 

Li-stability of qj^k simply follows from 

From ()19p and from the fact that fik is a stationary distribution for Kk it is easy to conclude 
bounds such as 

Il^.>(/)IIL(.,)^7'^-^II/IIL(,,). 
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This bound has the major disadvantage that it degenerates exponentially in k—j since 7 > 1. 
In the following, we assume and exploit mixing properties of the kernels in order to obtain 
Lp-bounds for p > 1 which do not degenerate in k — j. 

Throughout this section, we assume the following mixing condition: 

Assumption C. There are constants a G (0, 1) and (3 G [0, 1] such that for all f G B{E) we 
have the following L2-bound for Qk-i.k- 

Uk-i,k{f)\\U,,.,) < ^WfWlM+f^f^kiff. (21) 

Additionally, our results will impose conditions that a is sufficiently small. In Section 15.11 
below it is shown that one way to ensure that (j2ip holds with a sufficiently small a is to 
assume that the kernels Ki^ satisfy Poincare inequalities associated with a sufficiently large 
spectral gap. 

Our first step is to iterate ()21|] in order to obtain L2-bounds for qj^k- 
Lemma 4.2. For 1 < j < k < n and f G B{E) we have the hounds 

UjM)\\l,i,,) < uwIm + (22) 

and 

Uj,k{f)\\L2{t.,) < -1 ll/llL2{/.fe)- (23) 

(1 - a) 2 

Proof. Iterating the bound (l2T]l and utilizing that fij{qj^k{f)) = fJ-kif) we get 

U^Mf)\\li,,)<^'-'\\f\\li,,)+ E /3«Vfe(/)'- 

1=0 

Applying to this the geometric series inequality immediately implies (j22p . Furthermore, since 
we assumed /3 < 1 and fJ-kif)"^ < ll-^lli2(Mfe) have 

ii'?..^(/)iiL{..)<E«^ii/iiL(..)- 

Applying again the geometric series inequality yields (|23]l . □ 

We now turn to Lp-bounds for the case of p = 2*^ with r G N. We proceed inductively, 
deducing the bound for p = 2^' from the bound for p = 2*""^. We begin by deriving an 
L2p-bound for qk-i.k from ([2T]) . 

Lemma 4.3. For I < k < n, f G B{E) and p > 1 we have 

Proof. Note that we have 

Mk-l,kif)\\f2pi^lk-l) = f^k~ii\Kk-i{gk-i,kf)\'^^) 
< t,k-iiKk-iigU,k\fn') 

= \\Qk-lM9k-l,k\f\^)\\L2{iik-i)- 
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Applying now the bound ()21|) yields 



□ 

Our next step is to show how by applying Lemma 14.31 we get an L2p-bound for qj^k from an 
Lp-bound. 

Lemma 4.4. Assume that 07^^"^ < 1 and that for 6{p) > 1 we have for 1 < j < k < n and 
f £ B{E) the inequality 

Uj,k{f)\\Lj,{t,,) < 5(p)ll/llLp(Mfc)- 

Then we have 
with 

1-1 

6{2p)=6ip)- ^ 



[1 - a72p-2)2p 

Proof. Define 6 = 0:7^^"^. Iterating the inequality of Lemma 14.31 and utilizing that /3 < 1, 
we get 

i=j+i 

Using our assumption on \\qj,k\\Lp{ii,) and the facts that WfUpiiik) ^ ll/llL2p(Mfe)' 7 > 1 and 
S{p) > 1, we get that 

ii^.>(/)iit(.,)<ii/iii;(..)7^-^^(p)^^E^- 

i=0 

Thus, since we assumed < 1, by the geometric series inequality we have 

\\Qj,k{f)\\L2piH) - '^(2p)ll/llL2p(Mfc) 

with 

1-1 

6{2p)=6ip)- ^ 



(1 - a72p-2)2p 

□ 

Combining Lemmas 14.21 and 14.41 we can state the key result of this section as follows: 

Theorem 4.5. For r G N, consider p = T' and assume that a7^~^ < 1. Then we have for 
1 < j < k < n and f £ B{E) the inequality 

ll9j,fc(/)llLp(/.,) < ^(p)|l/llLp(/.fe) 

with 

r ^l-2-(*-i) ^r-2+2-('— 1) 



n l 1 
(1 _ ay»-2)2- < 1 - a72'--2 
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Proof. The case r = follows from (I20p . In the case r = 1, the inequality coincides with 
(j23|) . The inequalities for r > 1 follow because Lemma 14.41 implies that we can choose 

^p'>=^p> 11 (1 

We can apply Lemma 14.41 iteratively, since 07^^^ < 1 implies 07''"^ < 1 for all q < p. For 
the upper bound on 5{p), we apply the geometric series equality in the nominator, bound 
the term in brackets under the exponent in the denominator by 1 — 07^"^ and apply the 
geometric series inequality to the product. □ 

Since the constants 6{2^') are monotonically increasing in r, we can immediately extend the 
bounds of Theorem 14.51 to general p > 1 using the Riesz-Thorin interpolation theorem (see 
Davies [5j, §LL5): 



Corollary 4.6. Consider p G [2^,2^"'"^] for r E N and assume aj^""^ ^ < 1. Then for 
1 < j < k < n and f G B{E) we have 

with 5{p) given by 

6{p) = 5{r+^), 
where 5(2''+^) is defined as in T/ieorem 14.51 

Since we need L2p-i^]:j-bounds in the error bounds of Section [3] we now show that given that 
we have an Lp-Lg-bound for Ki^., we can immediately conclude from CoroUarv 14.61 an Lp-Lq- 
bound for 



Corollary 4.7. Consider p > 1 and q > 1. Let q G [2'",2'"^^] for r G N and assume 
<^ i_ Assume that for 1 < j < n we have a constant 9j(p, q) > such that 

\\Kjif)h,i,^)<ej{p,q)\\fh^^^^y (26) 

Then for j < k < n and f £ B{E) we have 

q-l 

\\QjM)\\Lp{^,,) <Oj{p,qh " <5(g)||/||L,(/.fc) 
with 5{q) as defined in Corollary \4-G\ 
Proof. By (j26p we have 

ll^i,fc(/)llLp(M,) < %(P>9)ll5j,j+l^j+l,fc(/)llL,(M,) 

and thus by Corollarv 14.61 

□ 



The following lemma shows, that Lp-L^-bounds for the qj^t can be used to obtain Lp-Lq- 
bounds for the original mappings qj^k- 
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Lemma 4.8. Assume that for some p > 1 and q > 1 we have a 6 > such that for all 
f G B{E) and for all 1 < j < k < n 

Uj,k{f)\\Lp{f,,) < 5 ll/llL,(/.fc)- 

Then we have 

p-i 

Proof. Note that we have 

lki,fe(/)llLp{/.,) = i^j {\9j,j+iqj+i,k{Kk{fW)^ 

< 7'^^j+i(i'7i+i,fc(^fc(/))r)^ 

where in the last step we used that by Jensen's inequahty < Ki^{\f\'^) and that 

is a stationary distribution for K^. □ 

Combining Theorem 14.51 with Corollary 14.71 and Lemma 14.81 we immediately obtain the type 
of bound needed in the error bounds of Section [3l 

Corollary 4.9. Consider p > 1 and q > 1. Let q G [2'^,2'^"^^] for r G N and assume 
07^' ~^ < 1. Assume that for all 1 < j < n we have a constant 6{p,q) > such that 

\\K,{f)\\Lp(^^)<9{p,q)\\fU^^^^y 
Then for all 1 < j < k < n and f G B{E) we have 

lki,fc(/)llLp(/.^) < Cj,k{p,q)\\f\\L,(,,k) 

with 

Cj,k{p^i) = ^{v,q)i ^ 7 " 

where 5{q) as defined in Corollary \4.6\ 

To round out the analysis of this section, we show that for functions / with fJ-kif) = we 
can moreover show the following result of exponential decay of \\qj,kif)\\Lp{fj.j)- 

Proposition 4.10. Let p > 2 with p = 2^ for r G N. Assume that 9p = 07^^"^ < 1. Then 
for I < j < k < n and f G B{E) with Hkif) = we have 

ll'^.>(/)lli(.,)<A.ri/lll(..)> 
where the constants Xp can be calculated recursively from A2 = 1 and 

X2j, = l + Xl ( a ( 1 - 



Moreover, 

Xp< 2(a(l-:^ 
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Proof. From (I22p and fJ^kif) = we immediately get the result for p = 2. Now we proceed 
inductively concluding from a bound for p a bound for 2p. Assume thus 92p < 1 and that we 
have 

ll'^.>(/)lli(.,)^VM/lli,(,,) (27) 

for 9p as defined above and for some Ap > 1. Observe that < 1 implies immediately 
6p < 1. From ()25p and ()27p and since we assumed Ap > 1 we have the inequality 

i=j+i 

Thus we have 
with 

A., = 1 + A^7^-^^^ E = 1 + A^7^^-^^,-; E ^ 



j=j-|-l \ ^/ 1=0 \ '^z 



Observing that 



1 

a 



and 

a2 



Aor, = 1 + A^ f a f 1 " 



and applying the geometric series inequality thus yields 

A2p < 1 + \p \ a\l-^ 

Choosing 

''2p = i + Ap - 

we have thus shown the desired decay inequality. Moreover, observe that, since 7 > 1 and 
a < 1, Ap > 1 implies that A2p > 1 and thus we have Ap > 1 for all p = T' . To show the 
upper bound on the coefficients Ap, define 

( ( a 
K = 2 a 1 ^7 

V V T 

Since k > 2 and Ap > 1, we have A2p < ^Ap. Since A2 = 1, this implies 

A2p < K^"^ 

□ 

By the Riesz-Thorin interpolation theorem. Proposition 14.101 immediately generalizes to the 
case p^T' . Corollarv 14.71 and Lemma 14.81 can be used to extend Proposition 14.101 to Lp-Lg- 
bounds and to the qj^k- 



non-asymptotic error bounds for sequential mcmc 20 

5. Error Bounds from Poincare and Logarithmic Sobolev Inequalities 

In Section 15.11 we show that assuming (global) Poincare inequalities associated with suffi- 
ciently large spectral gaps for the kernels K/^ is sufficient for guaranteeing that the results on 
-Lp-Stability of the Feynman-Kac propagator j, from Section [4.21 can be applied. In Section 
15.21 we then give a more explicit version of our error bound for Sequential MCMC in terms 
of the constants in Poincare and Logarithmic Sobolev Inequalities. We continue to work in 
the framework introduced in Section [4.11 

5.1. Poincare Inequalities and Stability of Feynman-Kac Propagators. Our Lp- 

stability results of Section 14.21 relied on Assumption [Cl namely, 

for coefficients a > and (3 G [0, 1], where 

qk-i,k{f) = Kk-1 {gk-i,kf) V/ G B{E), 

and on the Assumption [B] of an upper bound 7 on the normalized relative densities Qk-i^k- 
Additionally, we needed conditions assuming that a is sufficiently small. 

In the following we relate Assumption O to Poincare inequalities for the transition kernels 
K^. We first show that Assumption O holds, provided that the following L2-inequalities for 
the kernels are satisfied: 

Assumption D. For 1 < k < n and p G (0, 1) assume that for all f G B[E), 

Var^^iKkif))<{l-p)Var,^if). (28) 

Lemma 5.1. Assume that Assumption\^ is satisfied for some p G (0,1) and AssumptionWi 
is satisfied for some 7. Then Assumption\^ holds with 

a = {1 — p)j and {3 = p. 

Proof. Note that we can write 

Mk-i,k{f)\\\^(^^^_^) = fJ'k~i{Kk-i{gk~i,kff) 

= pk-i{Kk-i{gk-i,kf - l^k~i{9k-i,kf)f) + Pk^i{9k-i,kff- 

Thus by ([28]) we have 

hk-iM)\\\2(t^k-i) - (1 - z') {Pk-i{{gk~i,kff) - P-k-i{gk-i,kff) + Pk-i{gk-i,kff 

< (l-p)7/xfc(/')+p/xfc(/)^ 

which proves the claim. □ 



We next show how the constant p from (j28p can be controlled in terms of lower bounds on 
the spectral gaps of the kernels K^. We add one additional assumption for the remainder of 
Section [5.11 Assume that is reversible with respect to /x^, i.e., for all f,g & B{E) 

i^kigKkif)) = PkifKkig)). 

Reversibility is, for instance, fulfilled by construction for Metropolis chains. 
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Lemma 5.2. Assume that for all 1 < k < n we have a G (0, 1) such that fulfills a 
Poincare inequality with constant X^, 

Xk f^kif^) < l^kif {I - Kk){f)) (29) 

for all f € B{E) with fJ-{f) = 0, where I denotes the identity mapping on E. Then we have 

f^kiKkif - f^kif)?) < (1 - XkfVar^^if) 

for all f € B{E). In particular, Assumption\^holds with 

p = mill (1 - Xkf. 

k 

Proof By 1^ we have for / G B{E) with fi{f) = and / # 0, 

f^kifKkif)) 



< 1-Afc, 



and thus the spectrum of Kj. is bounded from above by 1 — on span(l) . Consequently, 
the spectrum of is bounded from above by (1 — A^)^ on span(l)-'-, i.e., 

< (1 - Xk) ■ 

mfv 

By the reversibihty of i^/j, this is equivalent to 

f^kiKkiff)<il-Xkfpkif^). 
To conclude the proof observe that thus for all / G B{E) 

fik{Kk{f - p{f)f ) < (1 - Afc) - l^kif)?)- 

□ 

Thus, assuming (j28p is equivalent to assuming a Poincare inequality. In the algorithm, p can 
be controlled by varying the number of MCMC steps: A sufficiently large number of MCMC 
steps makes p large and accordingly it makes a small. For future reference, we also give a 
version of Theorem 14.51 under the stronger assumption that ()28p holds for some sufficiently 
large p G (0, 1). This follows immediately from the Theorem by inserting the values of a and 
13 from Lemma l5.1i 



Corollary 5.3. Assume that psp holds for some p with (1 — p)^^ ^ < 1. Consider p = 2^' 
for r G N. Then we have for 1 < j < k < n and f G B[E) the inequality 

UjAf)\\Lp{^l,) < Kp)\\f\\LM 

with 

'(P) = n n _ n - ,)^2.-n2-. < 



/i (1 - (1 - Ph^'-'?-' 1 - (1 - Ph^"-' ' 
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5.2. Explicit Error Bounds. In this section we introduce a further parameter tk for our 
transition operators K/. which is thought to be the running time or number of MCMC steps 
contained in Kf^. We write Kjl" in the following to make this dependence clear. Throughout 
this section, we assume two families of mixing conditions depending on tk, a hypercontrac- 
tivity inequality and an L2-L2 inequality 

Assumption E. There exist positive constants a^, and 6^ such that for all f G B[E) 

\\K^U)\K,,^,(,,)<\\f\K(,.)^ (30) 
where q{p, t^) = 1 + (p — 1) exp(2a^tfe), and 

\\Kl''{f)-f^k{f)\\l(^,,)<eM-mk) \\f - l^k{f)\\l^,,y (31) 

Inequalities of this type follow, respectively, from a Logarithmic Sobolev inequality and a 
Poincare inequality for the underlying MCMC dynamics, see, e.g., Deuschel and Stroock 
|10j or Ane et al. [1]. Furthermore we assume again boundedness of relative densities, i.e., 
Assumption iBl All proofs are at the end of the section. 

We have the following bounds for qj^k provided that the running times tj, . . . ,tk are chosen 
sufficiently large. 

Proposition 5.4. Fix < j < k < n, t £ (0,1), 1 < s G N and p = 2^ . Assume that for 
j <l<k 

ti>^ [{p - 1) log(7) - log(l - r)] . (32) 
Then we have for f G B(E) 

\\Qj,kif)\\Lp{^l,) < Cj,k{p,p)\\f\\Lp{f,k) 

with 

^s-l+l/p 
Cj,k{P,P) = • 

r 

If in addition we have for p' > p and j < I < k, 

1 r. 



ti > 
' - 2a* 



log(p' - 1) - log(p - 1) , (33) 



then we have for f £ B{E) 

lkj,fc(/)IIV(M,) < Cj,k{p\p)\\f\\Lp(^,k) 

with 

Cj,kip',p) = 7^- 



Here and in the following, it is straightforward to relax the requirement of p = 2*, see 
Corollary 14.61 The constant r controls the contractivity in of the MCMC steps in the 
following sense: In order to apply our error bounds of Section 15.11 namely, Corollary 15. 3| we 
need to ensure that 

< 1 - jP-\l _ p,) 1 _ ^P-ie"'^**'. 

r is a uniform measure of by how much this inequality is satisfied, i.e., r is assumed to be a 
constant with 

T < 1 - 7P"^(1 - e"'''*') for all j<l<k. 
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Our next step is to utilize the constants Cj^kip' ,p) in order to bound the constants that arise 
in the error bound of Theorem 13. 2[ 

Corollary 5.5. Fix < j < /c < r G (0, 1), 2 < s G N and p = 2*. Assume that for 
l<l<n, 

^ [(2p-l)log(7)-log(l-r) 



ti > 



and 



Define 



Then we have 



Furthermore 



26* L 



1 



ti> 

- 2a* 



log(p - 1) - log ( - - 1 



(34) 
(35) 



Hp) 



2s+- 

ry p 



Cj,kip) <h{p). 



Ckip) <Ck{p) <{{! + 7)^3) k hip), and Mp) <Mp) < (k + l)^- 
Adding the final observations that for / € B{E) and < j < n, 



Var^^,(g,,„(/))<6,-„(2,2)2||/||2^(^^ 



and 



Var^„(g„,„(/))<||/||i^(^^), 

we are now in the position to bound all the terms in the error bound of Theorem 13.21 through 
7, T, p, N and n. Thus we arrive at the following version of the corollary: 

Corollary 5.6. Fix < n G N, r G (0, 1), 2 < s G N and p = 2"" . Assume that for 1 < I < n 



ti > 



2h* 



{2p - 1) log(7) - log(l - r) 



and 



ti > 



1 

2< 



log(p - 1) - log ( - - 1 



Then for f G B{E) we have 

iVE[|z.„^(/)-/.„(/)|2] 



< 



1 + n7T-2 + n((l + 7) V 3)7^^+^ V-^e^'P 



If in addition 
then we have 



N > 2((1 + 7) V 3)n72^+P V'^, 



e^'P < 2 



1 + n^T 



(36) 
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Finally, for the sake of illustration we also state these bounds for a concrete choice of parame- 
ters, namely 7 = 2, r = 0.8, p = 4 and thus s = 2. After rounding the coefficients to improve 
readability (in a way that makes the inequality slightly worse) and inserting the bound on 
En this yields the bound 



1 + 4n 180n + 560n^ 
+ 



N A^2 

The required lower bound (j36p on N is given by > 180 in this case. 

Proof of Proposition \5.4\ Comparing inequalities ()28p and (|3ip shows that the terms exp(— 26^*t) 
play the role of (1 — p) in the setting of Section [5.11 Thus, assuming for all j < Z < A; 

1 _ e-^b;ti^p-i ^ ^ 

or, equivalently, (p2]) ensures that we can apply Corollary 15.31 to obtain the bound 

\\Qj,kif)\\MH) ^ ^iP)\\f\\LA^^k) (37) 

with 

s-2+2/p 

6{p) = 

Now applying L emma [4 . 8 1 allows to conclude from this bound for qj^k the desired Lp-Lp-bound 
for Qj i. with 

^s-2+2/p ^ ^-i+i/p 

c{p,p) = 7 P = . 

r r 

We next turn to the Lp'-Lp-bound. By (pO]) . ensuring 

p < I + {p — l)e^"j* 
or, equivalently, ([33|) is a sufficient condition for 

Thus we conclude from applying first Corollary 14.71 and then Lemma 14.81 to p7p the desired 
Lp/-Lp-bound for qj^^ with 

_ ^s-l+l/p s-l+l/p 

Cj,k{p'^P) = 7 " 7^ = 7^. 

r T 

□ 

Proof of Corollary \5.5[ Choose Cj,fe(p',p) as in Proposition 15.41 Since 

c,,(p)=m„(u,,(p,|),E,,(2,p)^), 

by Lemma [4.H we need to apply Proposition l5.4l for {p,p/2) and {2p,p). Note that if inequality 
(j32]) holds for 2p it also holds for p. Conversely, if (p3]) holds for (p,p/2) it also holds for 
{2p,p) since 

2x - 1 1 
2 + 



X — 1 X — 1 

is decreasing in x. This motivates our assumption of (|34p and (j35p . Now observe that 

2s+i 

1 < Cj, fc (^p, < c,-fc(2p,p)2 = = /i(p) 

and thus we have 

Cj,k{p) <h{p). 
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Since for all x G -E we have 

-1 < - 1 = gjj+i{x) - 1 < 7 - 1, 

it follows that 

2+||(?,-,+i(l)-l||i^(^^,) < ((1+7)V3) 
and we can bound Ck{p) as follows: 

k-l 

Ck{p) = Yl Ci,fe(p)(2 + \\qj,j+i{l){x) - < ((1 + 7) V 3) h{p) k. 

3=0 

Since this upper bound is monotonically increasing in /c, it also applies to c^ip)- We now 
turn to Vk{p). Observe that we have 



Vk 



(p) = sup I J2j=o ^^^f:> (^jMf)) 
= SUp|^^^^^/ij(gj-fe(/) 



/Gi?(i?),||/||i^(^^)<l 
/Gi?(i?),||/||z.,(^,)<l 



k 
j=0 



As we have 



r 



we get the desired upper bound on Vk{p)- Since this upper bound is increasing in k, it also 
applies to Vk (p) ■ □ 

6. Dimension Dependence 

A particular advantage of our error bounds is that they allow for deriving the algorithm's di- 
mension dependence fairly explicitly for the case where the measures fik are product measures 
on R*^. This can be seen as a first step to understanding the algorithm's overall dimension 
dependence. 

We demonstrate that if we have a one-dimensional setting where our bounds apply, then we 
can obtain bounds of the same order for the d-dimensional product of the one-dimensional 
target distribution by increasing the computational effort by a factor of order 0{d^) and thus 
by a factor which is polynomial in d. Consider a sequence of distributions /iov • on M 
such that relative densities are bounded by 7 and such that we can obtain sufficiently good 
mixing properties for the dynamics Ki^ from Logarithmic Sobolev inequalities. The constants 
in Logarithmic Sobolev inequalities are not dimension-dependent for product measures, see 
Ane et al. [I]. Thus, we obtain the same constants in the mixing conditions for the d- 
dimensional product dynamics K^^^ with target fif'^ as for the one-dimensional dynamics Kj.. 
Since the d-dimensional relative densities g^^l._^i are d-fold products of the one-dimensional 
densities gkk+i' need to increase the number of interpolating distributions by a factor d 
when switching from M to W^. This can be done by inserting d — 1 additional distributions 
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/if ^ to /xf between jif^ and ^if^^ where, for I < j < d, fifj is given by 
fj,fj{dxi,...,dxd)= \ Y[9k,k+i{xi)] fif'^{dxi,...,dxd). 



\i=i 



Note that nf^ = nf'^ and /if^ = /if^^. Moreover, the relative densities between nfj and 

/^f j+i are bounded by 7. Assume furthermore that the Logarithmic Sobolev constant for the 
d—1 interpolating measures "inserted" between //^ and ^k+i lies in between the Logarithmic 
Sobolev constants for ^^^^ and fJ-^^^i- Thus MCMC with respect to the inserted distributions 
is not more difficult than MCMC with respect to the original distributions. Now re-index the 
sequence of n^'^^ = n-d + 1 product measures on to //q'^^ , . . . fin} and denote the associated 
transition kernels and relative densities by Kjf^ and g^^\_^i- Then, we obtain dimension- 
independent constants Cj^k{p) in the inequalities (fT3]) for the propagator gj'^ derived from xjf'^ 

and gl^l._^_i- (Recall that our bounds on the constants cj^k from Section U] were independent of j 
and k.) Notably, these constants coincide with those derived for the original one-dimensional 
sequence //q, • • • , /^n and we have fi^'^ = fi^J. 

Overall, we then find that the algorithm's error depends on the dimension like 0((i^) in this 
example: We need one factor d since each step of the product dynamics is more costly to 
simulate, one factor d due to the fact that we increase the number of levels by a factor d, and 
one factor d because we have to adjust the number of particles N by the same factor as the 
number of levels n as can be seen from Theorem 13.21 It is an open question whether similar 
results can be derived from minorization and drift conditions as in Whiteley |22] . 
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